External Correspondence: Decompositions of the Mean Probability Score
نویسنده
چکیده
Two evaluative criteria for probabilistic forecasting performance, consistency with the axioms of probability theory and external correspondence with the events that ultimately occur, are distinguished. The mean probability, or Brier score (PS), is the scoring rule most commonly used to quantify external correspondence. A review is made of methods for decomposing PS into components that represent distinct and important aspects of external correspondence. Data from an empirical study of forecasting performance are used to illustrate the interpretation of the components of the most recent decomposition of PS (J. F. Yates, Forecasting performance: A covariance decomposition of the mean probability score. Paper presented at 22nd Annual Meeting of the Psychonomic Society, Philadelphia, November 1981; also an unpublished manuscript). Substantively, the most important finding of the study was a "collapsing" tendency in forecasting behavior, whereby subjects were inclined to report forecasts of .5 when they felt they knew little about the event in question. This finding is problematic because self-reported knowledge was only minimally related to the actual external correspondence of the subjects' forecasts. A survey of uses of PS decompositions suggests, among other things, that current research typically emphasizes calibration, perhaps to the neglect of other, more important dimensions of external correspondence.
منابع مشابه
Loss Functions for Binary Class Probability Estimation and Classification: Structure and Applications
What are the natural loss functions or fitting criteria for binary class probability estimation? This question has a simple answer: so-called “proper scoring rules”, that is, functions that score probability estimates in view of data in a Fisher-consistent manner. Proper scoring rules comprise most loss functions currently in use: log-loss, squared error loss, boosting loss, and as limiting cas...
متن کاملDelphi application in solicitation of qualitative risk factors for estimation of a perceived probability of default: Case of Karafarin Bank
Unreliability of financial statements in Iran has urged this country’s financial services industry management to manipulate practices by which they could gain reliable risk scores for borrowers. This research extracts the most influential qualitative factors that would impact the default of a business relationship borrower. Solicitation of the factors is done through Delphi methodology. The mea...
متن کاملEvaluating and managing the probability of medical errors in nursing personnel using the HEART method
Introduction: Medical errors cause serious and often preventable injuries to patients. Studying human errors and their use as an opportunity for learning is a key factor in the effort to improve patient safety and quality of care in the hospitals. The purpose of this study was to identify and evaluate human errors to reduce their risks in nursing personnel using the Human Error Evaluation and R...
متن کاملDetermining Varying Usage of Sources of Information among Different Involvement Groups
This study tries to investigate the difference in usage of sources of information by the consumers for FMCG products when they are segregated into homogeneous groups. Mean value is calculated for each group which is ranked to identify the source of information more often used by each group. Further on, the one-way ANOVA (F-test) is performed on score values provided by the respondents to find ...
متن کاملبررسی ابعاد ظاهری دنبه و ارتباط آنها با وزن دنبه در گوسفند نژاد لری بختیاری
In this study external fat-tail dimensions (upper, middle and lower width, length, length of gap, depth and upper circumference) and fat-tail weights collected on 724 Lori-Bakhtiari sheep were used to study external fat-tail dimensions and their relationships with fat-tail weights. Sheep were 3 months to 6 years old and slaughtered at the industrial slaughter house of Joneghan in Chaharmohal an...
متن کامل